Modeling the spatial layout of images beyond spatial pyramids

نویسندگان

  • Jorge Sánchez
  • Florent Perronnin
  • Teófilo Emídio de Campos
چکیده

Several state-of-the-art image representations consist in averaging local statistics computed from patch-level descriptors. It has been shown by Boureau et al. that such average statistics suffer from two sources of variance. The first one comes from the fact that a finite set of local statistics are averaged. The second one is due to the variation in the proportion of object-dependent information between different images of the same class. For the problem of object classification, these sources of variance affect negatively the accuracy since they increase the overlap between class-conditional probabilities. Our goal is to include information about the spatial layout of images in image signatures based on average statistics. We show that the traditional approach to including the spatial layout – the Spatial Pyramid (SP) – increases the first source Corresponding author Email addresses: [email protected] (Jorge Sánchez), [email protected] (Florent Perronnin), [email protected] (Teófilo de Campos) 1Most of this work was done while J. Sánchez was at CIII, Universidad Tecnológica Nacional, Factultad Regional Córdoba, X5000HUA, Córdoba, Argentine. Preprint submitted to Pattern Recognition Letters July 25, 2012 of variance while only weakly reducing the second one. We therefore propose two complementary approaches to account for the spatial layout which are compatible with our goal of variance reduction. The first one models the spatial layout in an image-independent manner (as is the case of the SP) while the second one adapts to the image content. A significant benefit of these approaches with respect to the SP is that they do not incur an increase of the image signature dimensionality. We show on PASCAL VOC 2007, 2008 and 2009 the benefits of our approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Influence of initial spatial layout on seismic behavior of masonry buildings with curved roof systems

Early design decisions made on building configuration and spatial design affect seismic behavior of buildings. Therefore introducing design guidelines and empirical methods implemented to assess seismic behavior of buildings have been proposed as an appropriate approach. Such concept helps architects to take into the consideration that how their preliminary design decisions influence downstream...

متن کامل

Introducing An Efficient Set of High Spatial Resolution Images of Urban Areas to Evaluate Building Detection Algorithms

The present work aims to introduce an efficient set of high spatial resolution (HSR) images in order to more fairly evaluate building detection algorithms. The introduced images are chosen from two recent HSR sensors (QuickBird and GeoEye-1) and based on several challenges of urban areas encountered in building detection such as diversity in building density, building dissociation, building sha...

متن کامل

Compact, Adaptive and Discriminative Spatial Pyramid for Improved Scene and Object Classification

The release of challenging datasets with a vast number of images, requires the development of efficient image representations and algorithms which are able to manipulate these largescale datasets efficiently. Nowadays the Bag-of-Words (BoW) based image representation is the most successful approach in the context of object and scene classification tasks. However, its main drawback is the absenc...

متن کامل

DBRIS at ImageCLEF 2012 Photo Annotation Task

For our participation in the ImageCLEF 2012 Photo Annotation Task we develope an image annotation system and test several combinations of SIFT-based descriptors with bow-based image representations. Our focus is on the comparison of two image representation types which include spatial layout: the spatial pyramids and the visual phrases. The experiments on the training and test set show that ima...

متن کامل

Modeling the potential of Sand and Dust Storm sources formation using time series of remote sensing data, fuzzy logic and artificial neural network (A Case study of Euphrates basin)

Due to the differences between the visible and thermal infrared images, the combination of these two types of images leads to better understanding of  the characteristics of targets and the environment. Thermal infrared images are really in distinguishing targets from the background based on the radiation differences and  land surface temperature (LST) calculation. However, their spatial resolu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Pattern Recognition Letters

دوره 33  شماره 

صفحات  -

تاریخ انتشار 2012